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Abstract 



We investigate the process of fixation of advantageous mutations 
in an asexual population. We assume that the effect of each beneficial 
mutation is exponentially distributed with mean value uj me( i = 1/(3. 
The model also considers that the effect of each new deleterious muta- 
tion reduces the fitness of the organism independent on the previous 
number of mutations. We use the branching process formulation and 
also extensive simulations to study the model. The agreement between 
the analytical predictions and the simulational data is quite satisfac- 
tory. Surprisingly, we observe that the dependence of the probability 
of fixation Pf ix on the parameter 0J me d is precisely described by a 
power-law relation, Pfi x ~ ^Zied m ^ ne exponent 7 is an increase func- 
tion of the rate of deleterious mutations U, whereas the probability 
Pfi x is a decreasing function of U. The mean value ujfi x of the benefi- 
cial mutations which reach ultimate fixation depends on U and co m ed- 
The ratio uifix/^med increases as we consider higher values of mutation 
value U in the region of intermediate to large values of co m ed, whereas 
for low u> me d we observe the opposite behavior. 



1 Introduction 

The process of adaptation in evolving populations takes place by the continu- 
ous production of beneficial mutations and the ultimate fixation of these vari- 
ants in the population. This mechanism is relevant not only to improve the 
adaptation of the organisms to the environment, but also to prevent that in an 
environment where a large supply of slightly deleterious mutations persists, 
the population presents a continuous decline of the mean population fitness 
leading to the extinction of the whole population. Besides, beneficial muta- 
tions have a crucial role to permit the adaptation in dynamic environments 
where drastic changes take place. The attempt to understand the dynamics 
of fixation of beneficial mutations stems from the classical population genet- 
ics with the pioneer works of Fisher and Haldane (|Fisher 1922| IFisher 1 930 
Hal dane 1927)) . With the developments in the experimental biology and the 
consequent abundance of data from real biological systems, especially those 
from bacteria (jde Visser et al. 19991 iRozen et al. 20021 IShaver et al. 2002]) 
and viruses populations (jMiralles et al. 1999j ICuevas et al. 200"2|b it is now 
possible to better understand the main mechanisms underlying the pro- 
cess of adaptation in these populations and the rhythm at which it oc- 
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curs. In this sense, recent theoretical developments have contributed to 
the advances in the field ()Gerrish and Lenski 19981 lUrr 20001 IBarton 1995 
IJohnson and Barton 2002) IGerrish 2001j) . For instance, it is known that the 
increase of the supply of beneficial mutations in an asexual population does 
not result in a linear response of the evolutionary process due to the com- 
petition between distinct lineages in order to reach fixation. In an asexual 
population beneficial mutations are fixed sequentially and in the case where 
two or more mutations compete for fixation only one mutation can be kept in 
the population, with the definitive loss of the remaining ones. This process 
is named clonal interference (jHill and Robertson 1966| Campos et al . 2003). 

Here, we consider that beneficial mutations are rare events and so they 
do not compete for fixation with other mutations. Nevertheless, deleterious 
mutations occur at a constant rate U. Deleterious mutations affect dras- 
tically the dynamics of fixation of the beneficial mutations. As previously 
demonstrated ( |Campos 2003| IPeck 199"4|) . the probability of fixation Pf ix of 
advantageous mutants is a decrease function of the mutation rate U. This is 
a consequence of the possibility of occurrence of such beneficial mutation in 
a genome with a large amount of segregated deleterious mutations, i.e., in a 
genome with very low fitness value. In this paper we consider the multiplica- 
tive fitness landscape at which the effect of each new deleterious mutation is 
independent of other genes. 

The paper is organized in the following way: In the next Section we de- 
scribe the model. In Section III we discuss the branching process formulation. 
In Section IV we show our results, and finally In Section V we present our 
conclusions. 



2 The evolutionary model 

The population consists of N asexual haploid organisms that evolve according 
to the Wright-Fisher model. In the model, the individuals in generation t + 1 
are direct descendants of the individuals in time t. The probability that 
an organism is descendant of a particular parent in the previous generation 
is proportional to the parent's fitness. We also assume that the genome 
of an individual is represented by an infinitely large sequence of bits S = 
(si,S2, • • • ,Soo), where the digit s a denotes the state of gene a, which can 
take two distinct values s a = 0,1. The state s a = means that the digit a 
remains in the original state of the ancestor of the population, whereas the 
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Figure 1: Probability of ultimate fixation as a function of the mutation rate 
U. The parameters are N = 1000, /? = 10 and Sd = 0.1. The data points 
correspond to the simulation results over 100, 000 runs, and the thick line is 
the theoretical prediction according to Eq. (fTTj) . 

state s a — 1 means that the nucleotide a has been hit by a mutation. Because 
we assume infinitely large genomes, the probability of reverse mutations is 
negligible, i.e., we consider that only transitions of the type s a = — > 1 can 
take place. 

When a newborn individual arises it acquires the number of mutations 
present in its parent's genome and an additional amount of new delete- 
rious mutations n taken from a Poisson distribution with parameter U, 
where U is the mean number of new mutations per individual per gen- 
eration. The aforesaid model was introduced by Kimura and Watterson 
(JKimura and Crow 1964| IWatterson 1975J) and it is referred to the infinite- 
sites model. The fitness of an individual depends on the total number k of 
mutations in its genome and is given by 



where Sd is the cost associated to each deleterious mutation. This case cor- 




(1) 
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responds to the multiplicative landscape, where each new mutation reduces 
the fitness of the organism by the same factor (Campos et al. 2000). 



For infinitely large populations, the distribution of frequencies of the class 
of individuals with k mutations in the equilibrium regime, which we denote 
by Ck, can be calculated by the following set of equations 

1 t \ U k ~i 

C k = T-,WjCj k> m. (2) 

w m -w k ^ (k-j ! 3 3 K ) 

]=m 

Using the above expression we can recursively calculate the ratios Ck/C m and 
estimate C m from the normalization condition Y^k Ck — 1 (|Colato and Fontanari 2001)) . 
where m is the index of the class of the fittest individuals existing in the pop- 
ulation. In our analysis we always assume m = 0. 

In our simulations, the initial population is distributed in different classes 
of individuals according to the frequencies of equilibrium, given by Eq. (J2J). 
In this case, it is not necessary to wait the population evolves up to reaching 
the stationary regime. 

In the first generation, we randomly select an individual which acquires an 
advantageous mutation with selective effect s b obtained from an exponential 
distribution 

g(s b ) = (3exp(-(3s b ), (3) 

which is the expected distribution, as argued by the extreme value theory 
JGillespie 19911 IQrr 200 3). Whether this mutation happens in a genotype 
with k deleterious mutations its adaptation value increases by a factor (1+Sf>), 
i.e., 

w k = (l + s b )(l-s d ) k . (4) 

The advantageous mutation can be propagated for future generations as soon 
as the individuals which have acquired it replicate. The fitness of those 
individuals carrying the beneficial mutation takes the same form as in Eq. 

(H) 

In our approach, we consider the beneficial mutation to be fixed when 
the genotype that has first acquired it becomes the most-recent common 
ancestor of the whole population (B arton 1995| IJohnson and Barton 2 002 
Campos 2003ilWilke 20031 |Campos et al. 2003| ). 
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Figure 2: Probability of fixation P^ x as a function of the parameter uo m ed 
for fixed values of mutation rate U. The data points are the theoretical 
predictions and the dashed-lines are the best fits which give a power-law 
distribution. The parameter values are Sd = 0.1 and from top to bottom 
U = 0.01, U = 0.1 and U = 0.2. 

3 Branching Process Formulation 

The theory of branching process (|Harris 1963|) was first used in the context 
of population genetics by R. A. Fisher ([Fisher 1922; IFisher 1930|) to study 
the survival of the progeny of a mutant gene and random fluctuations in the 
frequencies of genes. Subsequently, Haldane used the theory to investigate 
the problem of fixation of an advantageous allele (jHaldane 1927ft . 

Haldane demonstrated that the probability ir, that a given genotype with 
selective advantage s reaches fixation in a two-allele model, is given by the 
solution of the following equation (H aldane 1927)) : 



For small selective values, the solution of this equation yields ir(s) ~ 2s. 
Recently, Barton (jBarton 19951 IJohnson and Barton 2002)) extended the 



1 - 7T 



= e 



(1+s)tt 



(5) 
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Figure 3: The mean value of the beneficial effect of the advantageous muta- 
tions that reached fixation Uf ix as a function of ui me d. The parameter values 
are N = 1000, Sd = 0.1 and from top to bottom U = 0.3, U = 0.1 and 
U = 0.01. The data points correspond to the simulational data whereas the 
lines are the numerical solutions of Eq. (|12|) . 



use of the branching process formulation to heterogeneous genetic back- 
ground, where the individuals can also produce offsprings which are not iden- 
tical copies of itself. In the formulation, the probability P, ltt that a beneficial 
mutation reaches fixation when it is present in a single genotype with genetic 
background % (for instance, % denotes the number of mutations) at generation 
t is obtained by iterating the following set of equations: 

oo 

(i-p ht _ 1 ) = j2w t ,(i-p* t y , (6) 

j=0 

where 

P* t = Mi, k P k ,t (7) 
k 

is the probability that an allele in background i at time t — 1 would get fixed, 
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given that at time t it is passed to one offspring, and is an element of 
the mutation matrix that gives the chance that an offspring from a parent 
at background i will be at background k. The quantity W^j is the probabil- 
ity that an allele in background i contributes with j offsprings to the next 
generation. If the distribution of offsprings is given by a Poisson distribution 
with mean £j = Wi/w (where w is the mean fitness population), then 

Wij = |e" 5 % (8) 

and Eq. © is written as 

(1 - P^O = exp [-&P*J . (9) 

The probabilities of fixation correspond to the solution of Eq. © obtained 
in the limit t — > oo, which we denote by Pi = Pi^oo- 

We estimate the probability of fixation of a beneficial mutation with fixed 
selective effect s b , Qfi x (sb), by the sum over the distinct genetic backgrounds 

i 

®fix = J2 p * di - ( 10 ) 

i 

The concentrations of the classes i, Ci, also means the chance of occurrence 
of the beneficial mutation in an individual in background i. 

As we consider that the beneficial effect of an advantageous mutation is 
exponentially distributed according to Eq. (JHJ), the probability of ultimate 
fixation Pf ix of a beneficial mutation is then given by 

POO 

Pfix = / g{s b )Q(s b )ds b . (11) 
Jo 

Another relevant measurement in our statistics is the quantity uifi X , which 
represents the mean value of the beneficial effect of those mutations that 
have reached ultimate fixation, which we calculate as 

1 f°° 

u f™ = d — / s b g(s b )Q(s b )ds b . (12) 

•V fix Jo 

Although we only solve Equation (fT2*|) numerically, an analytical approxi- 
mation can be obtained when we consider small values of the parameter 
&med — and zero mutation rate. In this case, the probability of fixation 
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Figure 4: The ratio Uf ix /u med as a function of oJ me d- The parameter values 
are N = 1000, Sd = 0.1 and from top to bottom U = 0.3, U = 0.1 and 
U = 0.01. The data points correspond to the simulational data whereas the 
lines are the numerical solutions of Eq. (|12|) . 



6(sb) is equal to the solution for the two-allele model (see Eq. (j5J)), i.e., 
6(sb) ~ 2sb- Substituting the expression 0(sb) = 2sb into Eqs. (|TT|) and (fH2j) 

we get 

2 

w /« = -Q, ( 13 ) 
and the ratio between Uf ix and tu me d yields the value 2. 



4 Results and Discussions 

In this section we present and discuss our results from the simulations and 
also from the theoretical formulation. 

In Figure^we plot the probability of ultimate fixation as a function of the 
mutation rate for parameter values (3 = 10 and Sd = 0.1. We obtain a good 
agreement between the simulational data and the theoretical curve obtained 
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integrating numerically Eq. (jllj) . The simulation results were taken over 
100, 000 distinct runs. As expected, we observe a continous decrease of the 
probability Pfi X as we increase the mutation value U . For high mutation rate 
U the agreement between the simulations and the theoretical prediction is 
less satisfactory than those seen for small and intermediate values of U. This 
problem occurs due to the occurrence of the Muller's ratchet phenomenon in 
finite populations, at which the continuous accumulation of deleterious muta- 
tions leads to loss of the best adapted classes of individuals as the population 
evolves. Thus, for very high U the population never reaches the equilibrium 
regime as supposed in the theoretical formulation ( Campo s 2003). 

In Figure 121 we display the probability Pji X as a function of the mean 
value of the distribution of selective effects oJ me d for fixed values of U. The 
probability Pf% x is an increase function of the parameter u> me d since those mu- 
tations with higher selective effect have a greater chance of reaching fixation. 
We observe that this relation is well described by a power-law scaling, i.e., 
Pfi X ~ u med wn ere the exponent 7 is an increase function of the mutation 
rate U, whereas Pf ix decreases with U, as we can see in Figure [TJ 

Figure 01 shows the mean value of the selective advantages of those mu- 
tations that reached fixation u>fi x plotted against the mean value u> m ed- As 
expected Uf ix increases with the augment of u me d, although we do not see a 
linear response. Besides Ufi X increases with the raise of U, and in this case 
U only those mutations awarding a large beneficial effect to the individuals 
have a non-negligible chance to reach fixation. We can better understand 
this scenario in Figure E| where we plot the ratio u)fi X /u) me d as a function of 
oJmed- From the figure, we see that the ratio ujfi X /u me d is an increase function 
of the mutation rate U for about all values of the parameter uj me d- Never- 
theless, an exception is obtained when we consider very small u me d, where 
we observe that when U — > the ratio ojf ix /oj me d — » 2. Moreover, we witness 
that for intermediate to high values of U the ratio uOfi X /uo me d is optimized 
around uj c ~ 0.1. 



5 Conclusions 

We have studied the dynamics of fixation of advantageous mutants in the 
multiplicative fitness landscape. We have investigated the problem by means 
of extensive simulations and also by a theoretical approach where we use the 
branching process formulation introduced by Haldane and extended by Bar- 
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ton to the case of heterogeneous genetic background. The simulation results 
are in very good accordance with the theoretical predictions. The benefi- 
cial effect of these advantageous mutations is assumed to be exponentially 
distributed with mean value cu me d- At once, we witness that the continuous 
supply of deleterious mutations to the population reduces the chance of fix- 
ation of the beneficial variants in the population. We have noticed that the 
relation between the probability of ultimate fixation Pf ix and the parame- 
ter uJmed obeys a power-law scaling like Pfi X ~ ^Ziedi wnere the exponent 7 
depends on the mutation rate U . Surprisingly we have observed that the 
ratio Ufix/umed between the selective advantages of those mutations that 
reached fixation and the mean value of the distribution is optimized around 
the critical value w me d — u c — 0.1 for intermediate to high values of U. In 
this range of U, we have also witnessed that an increase of U also means a 
higher value of ujf ix /uj me d. For very low values of u me d, the branching process 
theory fails to predict ujf ix /uj me d when we consider large values of U: while 
the simulations show that the ratio ujf ix /u me d goes to 1, which corresponds 
to a random stochastic regime, the theoretical analysis gives Uf ix /uj me d = 2, 
which is the same value attained when we have mutation rate U — 0. Actu- 
ally, a smaller value of U means that the random stochastic regime will be 
attained at smaller values of co me d- 
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